Picture for Yifan Zhao

Yifan Zhao

TVIR: Building Deep Research Agents Towards Text--Visual Interleaved Report Generation

Add code
Jun 01, 2026
Viaarxiv icon

The Right Inference Strategy Is All You Need: Nearly Training-Free Domain-Wise Inference for EgoCross Challenge

Add code
May 30, 2026
Viaarxiv icon

RoboStressBench: Benchmarking VLM Robustness to Physical Visual Stress in Embodied Scenes

Add code
May 30, 2026
Viaarxiv icon

Envisioning Beyond the Few: Disentangled Semantics and Primitives for Few-Shot Atypical Layout-to-Image Generation

Add code
May 29, 2026
Viaarxiv icon

Nautilus: An Auto-Scheduling Tensor Compiler for Efficient Tiled GPU Kernels

Add code
Apr 16, 2026
Viaarxiv icon

RiO-DETR: DETR for Real-time Oriented Object Detection

Add code
Mar 10, 2026
Viaarxiv icon

WorldTree: Towards 4D Dynamic Worlds from Monocular Video using Tree-Chains

Add code
Feb 12, 2026
Viaarxiv icon

LongCat-Flash-Thinking-2601 Technical Report

Add code
Jan 23, 2026
Viaarxiv icon

Seeing through Light and Darkness: Sensor-Physics Grounded Deblurring HDR NeRF from Single-Exposure Images and Events

Add code
Jan 21, 2026
Viaarxiv icon

Towards Unified Co-Speech Gesture Generation via Hierarchical Implicit Periodicity Learning

Add code
Dec 15, 2025
Viaarxiv icon